Skip to content

Conversation

@Titus-von-Koeller
Copy link
Collaborator

@Titus-von-Koeller Titus-von-Koeller commented May 25, 2024

This PR to main serves the purpose to keep an overview of all the extensive changes that have been introduced to multi-backend-refactor to the iterative PRs around this topic.

This will not be merged into master and instead the changes will be ported to the new custom_ops API that's already merged to main. Future backend PRs should be addressed directly at main.

jianan-gu and others added 30 commits December 4, 2023 20:23
Enable igemmlt int test on rocm
@github-actions
Copy link

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

matthewdouglas and others added 5 commits February 10, 2025 15:40
* fix xpu dtypoe

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix nf4 dtype

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix version

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix setup version

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
jiqing-feng and others added 9 commits March 4, 2025 20:39
* enable benchmark script

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Small fixes to non_cuda_backends.mdx

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Titus <9048635+Titus-von-Koeller@users.noreply.github.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* enable quant storage

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix to numpy

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix 4bit XPU dequant 4bit

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix default value

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix ipex linear set

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix ipex linear set to false when calling state dict

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix Int8Param device patch

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix xpu to cpu

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix xpu cpu data device

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
* fix intel cpu/xpu warning

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix error log

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix lib

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* rm return Nonr

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* error log only without ipex

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix import eerror

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
@anadon
Copy link

anadon commented Apr 14, 2025

Could someone post about the status/progress of this PR? Like a list of checked and unchecked known items to do.

Liangliang-Ma and others added 2 commits April 15, 2025 11:13
* enable xpu 8bit optim

* add deqaunt_blockwise

* dequantize_blockwise

* add bakcend synchronize

* refine code

* ipex dep

* ipex dep too

* ipex version check

---------

Co-authored-by: jiqing-feng <jiqing.feng@intel.com>
Authored by: Chetan Kumar Verma <chetan.kumar.verma@intel.com>
Co-authored-by: Ruheena Suhani Shaik <ruheena.suhani.shaik@intel.com>
Co-authored-by: Bhargav Eede <bhargav.eede@intel.com>
Co-authored-by: Vivek Goel <vivek.goel@intel.com>

Co-authored-by: Ruheena Suhani Shaik <rsshaik@habana.ai>
@Titus-von-Koeller
Copy link
Collaborator Author

Please see this short update about the multi-backend refactor #1596.

cc @anadon

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
jiqing-feng and others added 2 commits April 29, 2025 09:31
@Titus-von-Koeller Titus-von-Koeller changed the title (WIP) Multi backend refactor -> main (full diff of all already merged PRs) (reference-only) Multi backend refactor -> main (full diff of all already merged PRs) May 8, 2025
@Titus-von-Koeller
Copy link
Collaborator Author

We're closing this PR, because the functionality contained therein has been merged to main following the torch.library API as a integration mechanism. Therefore, this branch here will remain undeleted for a while, in case anyone still wants to reference it and we'll leave the build up, until the official release from main.

In the mean-time the already merged code can be used by downloading the wheel from CI:
https://github.com/bitsandbytes-foundation/bitsandbytes/releases/tag/continuous-release_main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.